System and method for integrating special effects with a text source

ABSTRACT

Systems, methods, and computer program products relate to special effects for a text source, such as a traditional paper book, e-book, mobile phone text, comic book, or any other form of pre-defined reading material, and for outputting the special effects. The special effects may be played in response to a user reading the text source aloud to enhance their enjoyment of the reading experience and provide interactivity. The special effects can be customized to the particular text source and can be synchronized to initiate the special effect in response to pre-programmed trigger phrases when reading the text source aloud.

FIELD OF THE INVENTION

Embodiments of the present disclosure relate to integrating specialeffects with a text source, and, in particular, with the reading of atext source.

BACKGROUND OF THE INVENTION

Generation of synchronized music and/or audible effects in combinationwith silent reading is described in, for example, U.S. PatentApplication Publication No. 2011/0153047. Such systems, however, aredependent on use of electronic books and algorithms that synchronize auser's reading speed to the sound effects by use of a learning algorithmthat times the sound effects based on the user's reading speed. Further,such systems do not provide an interactive experience, but rather apassive experience, and users complain of the sound effects beingdistracting when silent reading. Further, such systems are limited toproviding audio sound effects.

As such, it would be desirable if there exists a system for integratingspecial effects for text sources, and providing the special effects inresponse to a user reading the text source.

BRIEF DESCRIPTION OF THE INVENTION

Embodiments of the present disclosure relate to special effects for atext source, such as a traditional paper book, e-book, website, mobilephone text, comic book, or any other form of pre-defined text, and anassociated method and system for playing the special effects. Inparticular, the special effects may be played in response to a userreading the text source to enhance the enjoyment of their readingexperience. The special effects can be customized to the particular textsource and can be synchronized to initiate the special effect inresponse to the text source being read.

In one aspect, a system for providing a special effect associated withan auditory input, can include

an electronic mobile device configured to:

-   -   receive an audible input from a user comprising speech of a user        reading one or more portions of a text source;    -   determine whether the audible input matches one or more        pre-determined triggers via a speech recognition algorithm; and    -   in response to determining that the audible input matches the        pre-determined trigger, command a special effect device to        output a plurality of special effects associated with the text        source;

wherein the special effect device comprises an audio speaker and a lightsource, and the at least one of the one or more special effects includesaudio content and light emission;

wherein the plurality of special effects comprises a first specialeffect and a second special effect, wherein the first special effect andthe second special effect are different, and wherein the electronicmobile device is configured to command the special effect output deviceto output the second special effect at least partially concurrently withoutputting the first special effect.

In some embodiments, the text source is pre-existing.

In some embodiments, the text source comprises a book.

In some embodiments, the text source comprises a comic book.

In some embodiments, the text source comprises a printed text source.

In some embodiments, the text source comprises an electronicallydisplayed text source.

In some embodiments, the electronic mobile device is configured tocommand the special effect device to begin outputting the first specialeffect before beginning to output the second special effect.

In some embodiments, the electronic mobile device is configured tocommand the special effect device to stop outputting the first specialeffect before stopping the output of the second special effect.

In some embodiments, wherein the electronic mobile device is configuredto command the special effect device to stop outputting the secondspecial effect before stopping the output of the first special effect.

In some embodiments, wherein the plurality of special effects comprisesa first special effect comprising an audio output, a second specialeffect comprising an audio output, and a third special effect comprisinga light emission.

In some embodiments, wherein a first pre-determined trigger causesoutput of the first special effect and a second pre-determined triggercauses output of the second special effect, wherein the firstpre-determined trigger is different than the second pre-determinedtrigger; and wherein the electronic mobile device is configured todetermine when a pre-determined trigger phrase is detected via a speechrecognition algorithm at least partly concurrently while outputting theplurality of special effects.

In some embodiments, wherein the electronic mobile device iscommunicably coupled but physically distinct from at least one of theone or more special effect devices.

In some embodiments, wherein at least one of the plurality of specialeffects comprises animation.

In some embodiments, at least one of the plurality of special effectscomprises video.

In some embodiments, at least one of the plurality of special effectscomprises a picture.

In some embodiments, the one or more pre-determined triggers compriseactive pre-determined triggers and inactive pre-determined triggers; andin response to determining that the audible input matches an activepre-determined trigger command the system to activate an inactivepre-determined trigger; and, in response to determining that the audibleinput matches the activated pre-determined trigger, command the specialeffect device to output one or more special effects.

In some embodiments, the electronic mobile device is configured todeactivate at least one of the active pre-determined trigger phrasesafter detection of the at least one of the plurality of pre-determinedtrigger phrases such that a user subsequently reading the at least oneof the plurality of pre-determined trigger phrases after detection ofthe at least one of the plurality of pre-determined trigger phrases doesnot result in commanding the special effect output device to output aspecial effect.

In some embodiments, the audible input from a user comprising speech ofa user reading one or more portions of a text source is pre-recorded andelectronically outputted.

In another aspect, a system for providing a special effect associatedwith an auditory input, can include

an electronic mobile device configured to:

-   -   receive an audible input from a user comprising speech of a user        reading one or more portions of a text source;    -   determine whether the audible input matches one or more        pre-determined triggers via a speech recognition algorithm,        wherein the one or more pre-determined triggers comprise active        pre-determined triggers and inactive pre-determined triggers;        and    -   in response to determining that the audible input matches at        least one of the one or more pre-determined triggers, command        one or more special effect devices to output a plurality of        special effects associated with the text source and in response        to determining that the audible input matches an active        pre-determined trigger command the system to activate an        inactive pre-determined trigger;    -   wherein the one or more special effect device comprises an audio        speaker and a light source, and the at least one of the one or        more special effects includes audio content and light emission;        wherein the plurality of special effects comprises a first        special effect comprising an audio output, a second special        effect comprising an audio output different from the first        special effect, and a third special effect comprising a light        emission;    -   wherein the electronic mobile device is configured to command        the special effect output device to output the second special        effect and/or the third special effect at least partially        concurrently with outputting the first special effect;    -   wherein a first pre-determined trigger causes output of the        first special effect and a second pre-determined trigger causes        output of the second special effect and a third pre-determined        trigger causes output of the third special effect, wherein the        first pre-determined trigger is at least partly different than        the second pre-determined trigger; and    -   wherein the electronic mobile device is configured to determine        when a pre-determined trigger phrase is detected via a speech        recognition algorithm at least partly concurrently while        outputting the plurality of special effects.

In yet another aspect, a system for providing a special effectassociated with an auditory input can include

an electronic mobile device configured to:

-   -   receive an audible input from a user comprising speech of a user        reading one or more portions of a text source, wherein the text        source comprises a printed book;    -   determine whether the audible input matches one or more        pre-determined triggers via a speech recognition algorithm; and    -   in response to determining that the audible input matches the        pre-determined trigger, command one or more special effect        devices to output a plurality of special effects associated with        the text source;    -   wherein the one or more special effect device comprises an audio        speaker and a light source, and the at least one of the one or        more special effects includes audio content and light emission;        wherein the plurality of special effects comprises a first        special effect comprising an audio output, a second special        effect comprising an audio output different from the first        special effect, and a third special effect comprising a light        emission;    -   wherein the electronic mobile device is configured to command        the special effect output device to output the second special        effect and/or the third special effect at least partially        concurrently with outputting the first special effect;    -   wherein a first pre-determined trigger causes output of the        first special effect and a second pre-determined trigger causes        output of the second special effect and a third pre-determined        trigger causes output of the third special effect, wherein the        first pre-determined trigger is at least partly different than        the second pre-determined trigger; and    -   wherein the electronic mobile device is configured to determine        when a pre-determined trigger phrase is detected via a speech        recognition algorithm at least partly concurrently while        outputting the plurality of special effects.

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing summary, as well as the following detailed description ofpreferred embodiments of the invention, will be better understood whenread in conjunction with the appended drawings. For the purpose ofillustrating the invention, there are shown in the drawings embodimentswhich are presently preferred. It should be understood, however, thatthe invention is not limited to the precise arrangements andinstrumentalities shown.

FIG. 1 is a general overview of a system according to an embodiment ofthe present disclosure;

FIG. 2 is a schematic representation of operation of a soundtrackaccording to am embodiment of the present disclosure;

FIG. 3 is a specific example of a soundtrack associated with a textsource according to an embodiment of the present disclosure;

FIG. 4 is a block diagram illustrating an arrangement of soundtrackfiles associating with one or more text sources according to anembodiment of the present disclosure;

FIG. 5 is a block diagram illustrating components of the electronicdevice, and their interaction with other components of the systemaccording to an embodiment of the present disclosure;

FIG. 6 is a example flow chart depicting a method of operation ofplaying sound effects associating with a text source according toembodiments of the present disclosure; and

FIG. 7 is a diagram illustrating various connected devices.

DETAILED DESCRIPTION OF THE INVENTION

Certain terminology is used in the following description for convenienceonly and is not limiting. The words “right”, “left”, “lower”, and“upper” designate directions in the drawings to which reference is made.The terminology includes the above-listed words, derivatives thereof,and words of similar import. Additionally, the words “a” and “an”, asused in the claims and in the corresponding portions of thespecification, mean “at least one.”

A reader's experience may be enhanced if, while reading a text source,special effects, such as, for example, audio sounds, music, lighting,fans, vibrations, air changes, temperature changes, other environmentaleffects and the like, are triggered in synchronization when specificwords or phrases of the text source are read.

For example, a system may be configured to detect, via a speechrecognition module, a particular pre-determined word or phrase of a textsource, process, and output a special effect related to one or moreportions (e.g., feature words) of the text source.

As another example, a system may be configured to detect or estimate thereader's position in a book through an estimation of a reading speed orthrough eye tracking.

Certain embodiments of the present disclosure can synchronize specialeffects and compensate for a time delay by including a system programmedto begin processing and outputting a special effect related to thefeature word prior to the feature word being read by the reader. As aresult, there may appear to be no delay between the time the featureword is read by reader, and the initiation of the special effect relatedto the feature word. Stated differently, the special effect related tothe feature word may be initiated generally simultaneously to the readerreading the feature word, providing an enjoyable “real-time” enhancedreading experience.

Referring now to FIG. 1, a system 100 for generation of a special effectaccording to an embodiment of the disclosure is shown. In the system100, an electronic device 101 may be used (such as by a reader 102) inconjunction with a text source 103, and one or more special effectmodules 104.

The text source 103 may refer to any pre-defined text, such as, forexample, a book such as a children's book, a board book, a chapter book,a novel, a magazine, a comic book, and the like; a script such as amanuscript, a script for a play, movie, and the like; a text message;electronic text, such as text displayed on, for example, a computer, amobile device, a television, or any other electronic display device.

In particular embodiments, the text source 103 can be a traditionalprinted text, also referred to herein as a physical text source. Inother particular embodiments, the text source 103 can be electronicallydisplayed text, also referred to herein as an electronic text source.

In certain embodiments, the text source 103 can be a pre-existing textsource. For example, in certain embodiments, the special effects can becorrelated to the text source long after creation of the text source. Inthis way, special effects can be correlated to any pre-existing textsources including, but not limited to, text printed days, weeks, months,years, or even centuries ago. Thus, no redistribution of the text sourcewith the system 100 is required to provide correlated special effects tothe text source.

The text source 103 may include one or more words, or strings ofcharacters, some of which can be trigger phrases, such as triggerphrases 105, 107, 109, and so on. A trigger phrase 105, 107, 109 mayrefer to one or more words, or string(s) of characters, programmed toelicit one or more responses from one or more components of theelectronic device 101. The text source 103 may also include one or morefeature phrases 111, 113, 115 which may refer to one or more words orphrases to which a special effect may be related.

As discussed above, it may be desirable to have a particular specialeffect played generally simultaneous to the feature phrase being readand/or spoken. The system 100 may be programmed to command one or moreof the special effect output modules 104 to play the special effect upondetection of one or more trigger phrases 105, 107, 109. Therefore, bythe time the processing of the command is complete and an actual specialeffect is output, the reader may be simultaneously reading the featurephrase 111, 113, 115 to which the special effect is related. As such,the system 100 may be programmed to synchronize playback of a desiredspecial effect generally simultaneously with the feature phrase 111,113, 115 being read by initiating playback of the special effect whenone or more trigger phrases 105, 107, 109 are detected. In this context,it is to be understood that “generally simultaneously” refers toimmediately before, during, and/or immediately after the feature phraseis being read.

In certain embodiments, at least one of the words in a feature phrase111, 113, 115 can be the same as at least one of the words in a triggerphrase 105, 107, 109. In other embodiments, no words can overlap betweena feature phrase 111, 113, 115 and a trigger phrase 105, 107, 109.

In further embodiments, the trigger phrase 105, 107, 109 can text bedesigned to be read before the feature phrase 111, 113, 115.

With reference to FIG. 2, a schematic representation of a portion of aspecial effect track 200 for a text source 103 is shown. The specialeffect track 200 may be multi-layered comprising one or more specialeffects that may play separate or concurrently during reading of thetext source 103. Each special effect layer may include one or morespecial effects. By way of example, three special effect layers areshown, although it will be appreciated that any number of layers couldbe provided in other forms of the text source profile. Each of thespecial effect layers 1, 2, and 3 may represent any type of specialeffect, including but not limited to an auditory effect, a visualeffect, an environmental effect, other special effects, and combinationsthereof.

The system 100 may optionally include a second electronic device 117which may include a microphone or other type of audio detector capableof detecting audio. The second electronic device 117 may communicate thedetected audio to the electronic device 101, via any communicationmethod described hereinthroughout, which may include, but is not limitedto: bluetooth, WI-FI, ZIGBEE, and the like. The second electronic device117 may also take the form of a bookmark. As such, the electronic device117 may include a page marking mechanism 119 that is adapted to identifyto a user the current page on the book if reading has stopped part waythrough the book. The second electronic device may include an engagementmechanism 120 that is adapted to secure the second electronic device 117to the text source 103. As shown, the second electronic device 117 isattached to the text source 103. It should be noted, however, that thesecond electronic device 117 may be attached to other elements, such as,for example, the reader 102, or any object.

In certain embodiments, auditory effects can include atmospheric noise,background music, theme music, human voices, animal sounds, soundeffects and the like. For example, atmospheric noise may refer toweather sounds, scene noise, and like. Background music may refer toorchestral music, songs, or any other musical sounds. Other audibleeffects may refer to animal sounds, human voices, doors shutting, andthe like, that are programmed to play upon detection of a triggerphrase.

In certain embodiments, visual effects can include any special effectthat is designed to be viewable by an active and/or passive user. Forexample, visual effects can include visual information on an electronicdisplay such as a computer, a mobile device, a television, a laser, aholographic display, and the like. Further, visual effects can includeanimation, video, or other forms of motion. Still further, visualeffects can include other light sources such as a lamp, a flashlight,such as a flashlight on a mobile device, car lights, Christmas lights,laser lights, and the like.

As used herein, the phrase “environmental special effect” refers to aspecial effect which affects the user's sense of touch, sense of smell,or combinations thereof. For example, an environmental special effectcan include fog generated by a fog machine; wind generated by a fan;vibrations generated by a massaging chair; physical movements of a usergenerated by a movable device, for example a wheel chair; and the like.

As used herein, an environmental special effect does not refer solely toauditory special effects such as sound effect, music and the like.Further, as used herein, an environmental special effect does not refersolely to a visual special effect. Moreover, as used herein, anenvironmental special effect does not refer solely to a combination ofan auditory special effect and a visual special effect.

As shown in the example soundtrack depicted in FIG. 2, upon trigger timeta1, special effect 23 of special effect layer 1 may be programmed tobegin playback at time ta2 (e.g., an approximate time the reader mayread a feature phrase) and to end playback at time ta3. At time tb1,special effect effect 25 of special effect layer 2 may be programmed tobegin playback at time tb2 and to end playback at time tb3. Similarly,upon trigger time tc1, special effect effect 27 of layer 3 may beprogrammed to begin playback at time tc2 and end playback at time tc3.

Trigger times ta1, tb1, tc1 correspond to triggers for respectivespecial effect layers 1, 2, and 3. A trigger time ta1, tb1, or tc1 maycorrespond to a time when the system detects a trigger phrase. Eachspecial effect may be configured to play for any desired length of time,which may be based on one or more various factors. For example, withrespect to a special effect of one of the special effect layers (e.g.,special effect 23 of layer 1), playback end time ta3 may correspond todetection of another trigger phrase of layer 1, playback of anotherspecial effect of layer 1 or detection of another trigger phrase orplayback of another special effect of different layer (e.g., layer 2and/or layer 3). Alternatively, or additionally, a special effect may beprogrammed to play for a predetermined duration of time. For example,each special effect may be configured to play for 3 seconds. As such,for special effect layer 1, ta3 may refer to a time 3 seconds after ta2.As another alternative, each special effect may be configured to playfor a random duration of time (e.g. through the use of a random numbergenerator to randomly generate a time duration for one or more of theaudio effects, as a part of the system 100).

FIG. 3 is a specific example operation of a special effect track for atext source 300. For the sake of simplicity, the text source 300consists of only two sentences. It should be appreciated, however, thatthe text source 300 may be of any length. It should also be noted thatthe trigger phrases may be of any desired length. The trigger phrase 22may be denoted as the sequence of words “[a]s matt walked outside”. Thesystem 100 may detect this trigger phrase at time ta1, and, at time ta2(corresponding to an approximate time when the reader may speak thefeature phrase “rainy”), with respect to special effect layer 1, playspecial effect 23 comprising weather sounds such as rain drops hittingthe ground. Therefore, any processing necessary to output a specialeffect of rainy weather may be performed prior to the reader actuallyreading the feature phrase “rainy”. As a result, the system 100 is ableto play the rainy weather sounds generally simultaneously to the readerreading the feature phrase rainy, providing an enjoyable “real-time”enhanced reading experience.

Another trigger phrase 24 may be the word “cat.” The system 100 maydetect this trigger phrase at time tb1, and, at time tb2, correspondingto an approximate time the reader may be reading the feature phrase“cat,” play the special effect 25 comprising a loop of a sound of acat's meow. Therefore, any system processing necessary to output soundsof cat meowing may be performed prior to the reader actually reading thefeature phrase “meow.” As a result, the system 100 is able to play thespecial effect 25 of a cat's meow generally simultaneously to the readerreading the feature word “meow.”

With respect to special effect layer 3, another trigger phrase 26 of thetext source 30 may be the sequence of words “and a large dog.” Thesystem 100 may detect this trigger phrase at time tc1, and beginplayback of the special effect 27 at time tc2. Therefore, any processingnecessary to output sounds of a dog barking may be performed prior tothe reader actually reading the feature phrase “bark”. As a result, thesystem 100 is able to play the special effect 27 of a dog barkinggenerally simultaneously to the reader reading the feature word “bark”,providing an enjoyable “real-time” enhanced reading experience.

At least because of the multi-layer nature of the special effect track,according to this example, for a period of time, a reader is able toexperience the special effects of rain hitting the ground, a catmeowing, and a dog barking concurrently.

Continuing with this example, the special effect 23 (e.g., the sound ofrain hitting the ground) may be programmed to end playback at time ta3.Similarly, the sound of the cat's meowing may be programmed to end attb3, and the dog's barking may be programmed to end at tc3. It should benoted that one or more layers of the special effect track may also be“pre-mixed.” For example, in response to detection of a single triggerphrase, one or more special effects of special effect layer 1 may bepre-programmed to play for a pre-determined period of time, one or morespecial effects of special effect layer 2 may be pre-programmed to beginplayback after a pre-determined time of the playback of one or moreeffects of special effect layer 1, and one or more audible effects ofspecial effect layer 3 may be pre-programmed to begin playback after apre-determined time after the playback of one or more special effects oflayers 1 and/or 2. The pre-mixed special effect track may be based on anaverage reading speed, which may be updated (e.g., sped up or sloweddown), at any given time by an operator of the system. Further, thesystem 100 may be able to detect and modify the pre-mixed special effecttracks based on user's reading speed ascertained by the system 100.

It should be appreciated that the special effect track may be packagedinto various file formats and arrangements for interpretation andplaying by corresponding special effect software running on a specialeffect player. By way of example only, the special effect track maycomprise a package of special effect files 400. The special effect files400 may include a general data file 401 and one or more layer files 403corresponding to each of the special effect layers of a special effecttrack for one or more text sources 103. It will be appreciated that thatthe special effect track may alternatively comprise only the data files401, 403 and that the layer files may be retrieved from a database ormemory during playing of one or more special effects of one or morelayers of the text source special effect profile, and, in such forms,one or more of the data files 401, 403 may contain linking or file pathinformation for retrieving the special effect files from memory, adatabase, or over a network.

The general data file 401 may comprise general profile data such as, butnot limited to, the name of the special effect track, the name of thetext source with which the profile is associated, and any otheridentification or necessary special effect information or profileinformation. The general data file 401 may also comprise reading speeddata, which may include average reading speed data duration of theoverall text source profile. Additionally, the general data file 401 mayinclude layer data comprising information about the number of specialeffect layers in the text source profile and names of the respectivespecial effect layers. The layer data may also include filenames, filepaths, or links to the corresponding layer data file 403 of each of thespecial effect layers.

Each layer file 403 may include special effects, which may include oneor more special effects for each layer, and the trigger phraseassociated with each of the one or more special effects for each layer.The layer file 403 may also provide information on the particularspecial effect features associated with the one or more special effectsof the respective special effect layer. For example, any predetermineddurations of time of which one or more special effects is set to play,and optionally, other special effect feature properties, such as, forexample, transition effects, such as fade-in, fade-out times, volume,looping, and the like.

It should be appreciated that the above description of the specialeffect files 400 is by way of non-limiting example only. As such, dataand special effects needed for creation and operation of embodiments ofthe disclosure may be stored in any combination of files. For example,one or more layer files 403 may be included in the general file 401, andvice versa.

In certain embodiments, a library of trigger phrases can be arranged ina plurality of discrete databases. For example, a text source, such as abook, can include, 100 pages of text including 10 chapters. Each chaptercan have a discrete database of pre-programmed trigger phrases. Further,at least one trigger phrase can, instead of or in addition to initiatinga special effect, can initiate the system to access a different databaseof trigger phrases for subsequent text that is read, also referred toherein as a database transition trigger phrase. By way of non-limitingexample only, while reading from chapter 1 (to which system 100 mayinclude a corresponding database) of a book, the reader may read atrigger phrase, which may cause the system 100 to output a specialeffect, and, additionally prompt the system 100 to access anotherdatabase corresponding to another chapter (e.g., chapter 2).Alternatively, upon reception of the trigger phrase, the system 100 maynot output a special effect, but simply be prompted to switch to anotherdatabase. It should be noted that databases may correspond to otherportions of text sources in keeping with the invention. For example, asingle chapter of a text source may include multiple databases. Also,databases may simply correspond to different portions of a text sourcethat has no designated chapters.

In other embodiments, essentially all of the trigger phrases can beincluded in a single database. In some embodiments, the system 100 mayinclude functionality to operate with different languages. For example,a first database may include pre-determined trigger phrases of one ormore different languages, which, as used herein may refer to dialects,speech patterns, and the like. Therefore, in operation, one or moreprocessors of the electronic device may be configured to access thefirst database. The system 100 may detect, for example, via a speechalgorithm, the language of the detected pre-determined trigger phrase.Based on the detected pre-determined trigger phrase of the firstdatabase, the system may access a second database which may include aplurality of pre-determined trigger phrases which may be the samelanguage as the detected pre-determined trigger phrase of the firstdatabase. And, in response to determined that at least one of thepre-determined trigger phrases of the second database is detected, thesystem 100 may command a special effect output device to play a specialeffect. It should be noted that the system 100 may include functionalityto operate with any number of languages and databases.

In some embodiments, a text source having a plurality of trigger phrasesmay have different active trigger phrases at given times. For example,at one point in time or portion of reading of the text source, a firstgroup of trigger phrases may be active (e.g., set to effect an action bythe system upon being matched to the auditory input of the text source).At another point in time or portion of reading of the text source, asecond group of trigger phrases may be active. The group of triggerphrases that are active may be referred to as a window, which may changeas subsequent words, or trigger phrases of the text source are read.

For example, a text source may contain words 1-15, and, for the sake ofsimplicity, in this example, a trigger word or phrase corresponds toeach word of the text source. At one point in time, active trigger wordwhich may correspond to words 1-5, respectively, while triggerscorresponding to words 6-15 may currently be inactive. However, afterword “1” is spoken, the window of active trigger word may “slide.” Assuch, triggers for words 2-6 may now become active, and word “1” nowbecomes inactive. After trigger word “2” is spoken, the window may againslide so that words 3-7 are active, while words “1” and “2” becomeinactive, and so on. It should be noted that the designation of activevs. inactive triggers need not be sequential. For example, triggers maybecome active or inactive randomly, or by user, or other means ofdesignation.

In some embodiments, the system 100 may command the output of specialeffects related to a location remote from the general location of thetext source. Such an output of the special effects may be based, atleast in part, on sensory information. The sensory information mayinclude auditory information, visual information, environmentalinformation, or any combination thereof, of the remote location.

For example, one or more special effect tracks may comprise live feeds(e.g., live audio stream) associated with one or more text sources. Forexample, a text source may contain content about the sights and soundsof New York City. One or more portions of a special effect trackassociated with the text source may be configured to have one or moretrigger phrases that elicit one or more special effects in the form ofactual live content, such as audio, video, or environmental effects fromsites or locations around New York City. Such sites may havemicrophones, cameras, sensors, such as temperature sensors, humiditysensors, wind speed sensors, etc. coupled to a network to allow forcommunication with other components of the system 100 to pick up thelive feeds and play the same through the electronic device 101 and/orone or more special effect devices.

FIG. 5 is a perspective block diagram of some of the components of thesystem 100. The system 100 may include a server 501 and the electronicdevice 101 (as discussed above), which may include an input unit 503, aprocessor 505, a speech recognition module 507, a memory 508, a database509, and one or more special effect output modules, such as audio outputmodule 104 which is adapted to produce an auditory special effect. Thedatabase 509 may include one or more special effect track filesassociated with respective text sources (such as, for example, theafore-discussed special effect track files 400).

The audio output module 104 may include a speaker 513, a soundcontroller 515, and various related circuitry (not shown), which maywork with the sound controller 515 to activate the speaker 513 and toplay audio effects stored in the database 509 or locally in the memory508 in a manner known to one of ordinary skill in the art. The processor505 may be used by the audio output module 104 and/or related circuitryto play the audio effects stored in the memory 508 and/or the database509. Alternatively, this functionality may be performed solely by therelated circuitry and the sound controller 515.

The speech recognition module 507 may include a speech recognitioncontroller 517, and other related circuitry (not shown). The input unit503 may include a microphone or other sound receiving device (e.g., anydevice that converts sound into an electrical signal). The speechrecognition controller 517 may include, for example, an integratedcircuit having a processor (not shown). The input unit 503, speechrecognition controller 517, and the other related circuitry, may beconfigured to work together to receive and detect audible messages froma user (e.g., reader) or other sound source (not shown). For example,the speech recognition module 507 may be configured to receive audiblesounds from a reader or other source, such as an audio recording, and toanalyze the received audible sounds to detect trigger phrases. Basedupon the detected trigger phrase (or each detected sequence of triggerphrase(s)), an appropriate response (e.g., special effect) may beinitiated. For example, for each detected trigger phrase, acorresponding special effect may be stored in the memory 508 or thedatabase 509. The speech recognition module 507 may employ at least onespeech recognition algorithm that relies, at least in part, on laws ofspeech or other available data (e.g., heuristics) to identify and detecttrigger phrases, whether spoken by an adult, child, or electronicallydelivered audio, such as from a movie, a TV show, radio, telephone, andthe like.

It should be noted that, in light of embodiments of the disclosureherein, one of ordinary skill in the art may appreciate that the speechrecognition module 507 may be configured to receive incoming audiblesounds or messages and compare the incoming audible sounds to expectedphonemes stored in the speech recognition controller 517, memory 508, orthe database 509. For example, the speech recognition module 507 mayparse received speech into its constituent phonemes and compare theseconstituents against those constituent phonemes of one or more triggerphrases. When a sufficient number of phonemes match between the receivedaudible sounds and the trigger phrase(s), a match is recorded. Infurther embodiments, the speech recognition module 507 may be configuredto receive incoming audible sounds or messages and derive a scorerelating to the confidence of detection of one or more pre-programedtrigger phrases. When there is a match (or high enough score), thespeech recognition module 507, potentially by the speech recognitioncontroller 517 or the other related circuitry activates the correlatedspecial effect.

A reader's experience may further be enhanced through periodic orsporadic updates, changes, and/or other special effect trackalterations. For example, according to embodiments of the presentdisclosure, operators may be able to access the server 501 to provideadditional special effect tracks, add, or otherwise modify existingspecial effect tracks to text sources. A user or reader of a text sourcemay then download or otherwise obtain the updated special effect trackfor a selected text source. As such, the reader may experience differentspecial effects than a previous time the user has read the text source.One or more of the special effect tracks may be modified in other waysas well, adding to the dynamic capabilities of embodiments discussedherein. For example, trigger words of a special effect track may bechanged, added, removed, or otherwise modified for the same text source.The special effect track may also be modified by changing a specialeffect elicited by the same trigger word. Such modifications can beperformed remotely by an operator, such as via the server 501 and thedatabase 509.

Modifications can also be performed through implementation of a randomnumber generator associated with the system 100. For example, the randomnumber generator may seemingly randomly generate numbers correspondingto one or more trigger words to be used with the text source, aparticular special effect to be used in response to a trigger word, andany other aspect of the special effect track to provide the reader oruser with a potentially different experience. As another example, theaforediscussed modifications of trigger phrases, sounds, and the likecan be effected through a pre-programmed sequence. For example, thefirst time the text source is read, one set of trigger words are used,the second time, another set is used, and a subsequent time, another setis used, and so on. Further, special effects can be programmed tosequentially change as well. Even still, any other aspect or combinationof the came can be programmed to be sequentially modified. Accordingly,different experiences can be had each time a text source is read.

In some embodiments, the electronic device 101, for example, via aspeech recognition algorithm, listen for trigger phrases output bysources other than the user, such as, for example, a TV show, a movie,radio, internet content, and the like). Upon reception of a matchedtrigger phrase, associated content may be displayed on the electronicdevice 101. For example, a user may be watching television, and a BMWcommercial plays. The system 100 may have a pre-determined triggerphrase for detection of a portion of the BMW commercial. When the system100 detects the phrase from the BMW commercial, associated content(e.g., an associated advertisement) from BMW may appear on theelectronic device 101. Further, the user may click, or otherwise selectthe advertisement from the electronic device 101 and receive morecontent pertaining to BMW.

In some embodiments, the speaker 513 may be distanced, or otherwisedecoupled from the microphone. For example, the speaker 513 may becommunicably coupled to the electronic device via a bluetooth, NFC, orany other wireless or wired means capable of allowing for communicationbetween the speaker 513 and the microphone. Such a configuration mayensure the microphone of the system 100 picks up audible messages spokenby the reader and not audible effects output by the speaker 513. Thesystem 100 may also employ one or more filters to filter or otherwiseblock the output audible effects from the speaker 513. Such filteringmay be possible due at least in part to the fact that the system 100knows which audible effects are currently being output. As such, one ormore filters knows exactly what audible sounds need to be filtered.

The system 100 may include a communication network 514 which operativelycouples the electronic device 101, the server 501, and the database 509.The communication network 514 may include any suitable circuitry,device, system, or combination of these (e.g., a wireless or hardlinecommunications infrastructure including towers and communicationsservers, an IP network, and the like) operative to create thecommunication network 514. The communication network 514 can provide forcommunications in accordance with any wired or wireless communicationstandard. For example, the communication network 514 can provide forcommunications in accordance with second-generation (2G) wirelesscommunication protocols IS-136 (time division multiple access (TDMA)),GSM (global system for mobile communication), IS-95 (code divisionmultiple access (CDMA)), third-generation (3G) wireless communicationprotocols, such as Universal Mobile Telecommunications System (UMTS),CDMA2000, wideband CDMA (WCDMA) and time division-synchronous CDMA(TD-SCDMA), 3.9 generation (3.9G) wireless communication protocols, suchas Evolved Universal Terrestrial Radio Access Network (E-UTRAN), withfourth-generation (4G) wireless communication protocols, internationalmobile telecommunications advanced (IMT-Advanced) protocols, Long TermEvolution (LTE) protocols including LTE-advanced, or the like. Further,the communication network 514 may be configured to provide forcommunications in accordance with techniques such as, for example, radiofrequency (RF), infrared, or any of a number of different wirelessnetworking techniques, including WLAN techniques such as IEEE 802.11(e.g., 802.11a, 802.11b, 802.11g, 802.11n, etc.), wireless local areanetwork (WLAN) protocols, world interoperability for microwave access(WiMAX) techniques such as IEEE 802.16, and/or wireless Personal AreaNetwork (WPAN) techniques such as IEEE 802.15, BlueTooth™, ultrawideband (UWB) and/or the like.

The electronic device 101 may refer to, without limitation, one or morepersonal computers, laptop computers, personal media devices, displaydevices, video gaming systems, gaming consoles, cameras, video cameras,MP3 players, mobile devices, wearable devices (e.g., iWatch by Apple,Inc.), mobile telephones, cellular telephones, GPS navigation devices,smartphones, tablet computers, portable video players, satellite mediaplayers, satellite telephones, wireless communications devices, orpersonal digital assistants (PDA). The electronic device may also referto one or more components of a home automation system, appliance, andthe like, such as AMAZON ECHO. It should be appreciated that theelectronic device may refer to other entities different from a toy suchas a doll, or a book.

In the description hereinthroughout, the term “app” or “application” or“mobile app” may refer to, for example, an executable binary that isinstalled and runs on a computing device, or a web site that the usernavigates to within a web browser on the computing device, or acombination of them. An “app” may also refer to multiple executablebinaries that work in conjunction on a computing device to perform oneor more functions. It should be noted that one or more of the abovecomponents (e.g., the processor, the speech recognition module 507) maybe operated in conjunction with the app as a part of the system 100.

The various illustrative logical blocks and modules described herein maybe implemented or performed with a general-purpose processor, anapplication specific integrated circuit, (ASIC) a digital signalprocessor, a field programmable gate array (FPGA) or other programmablelogic device, discrete gate or transistor logic, discrete hardwarecomponents, or any combination thereof. A general-purpose processor maybe a microprocessor, but in the alternative, the processor may be anyconventional processor, controller, microcontroller, or state machine. Aprocessor may also be implemented as a combination of computing devices,e.g., a combination of a digital signal processor and a microprocessor,a plurality of microprocessors, one or more microprocessors inconjunction with a digital signal processor core, or any other suchconfiguration.

The steps of a method or algorithm described in connection with thedisclosure herein may be embodied directly in hardware, in a softwaremodule executed by a processor, or in a combination of the two. Asoftware module may reside in RAM memory, flash memory, ROM memory,EPROM memory, EEPROM memory, registers, hard disk, a removable disk, aCD-ROM, or any other form of storage medium known in the art. Anexemplary storage medium is coupled to the processor such that theprocessor can read information from, and write information to, thestorage medium. In the alternative, the storage medium may be integralto the processor. The processor and the storage medium may reside in anASIC. The ASIC may reside in a computing device. In the alternative, theprocessor and the storage medium may reside as discrete components in acomputing device.

In one or more exemplary designs, the functions described may beimplemented in hardware, software, firmware, or any combination thereof.If implemented in software, the functions may be stored on ortransmitted over as one or more instructions or code on acomputer-readable medium, and preferably on a non-transitorycomputer-readable medium. Computer-readable media includes both computerstorage media and communication media including any medium thatfacilitates transfer of a computer program from one place to another. Astorage media may be any available media that can be accessed by ageneral purpose or special purpose computer. By way of example, and notlimitation, such computer-readable media can comprise RAM, ROM, EEPROM,CD-ROM or other optical disk storage, magnetic disk storage or othermagnetic storage devices, or any other medium that can be used to carryor store desired program code means in the form of instructions or datastructures and that can be accessed by a general-purpose orspecial-purpose computer, or a general-purpose or special-purposeprocessor. Also, any connection is properly termed a computer-readablemedium. For example, if the software is transmitted from a website,server, or other remote source using a coaxial cable, fiber optic cable,twisted pair, digital subscriber line (DSL), or wireless technologiessuch as infrared, radio, and microwave, then the coaxial cable, fiberoptic cable, twisted pair, DSL, or wireless technologies such asinfrared, radio, and microwave are included in the definition of medium.Disk and disc, as used herein, includes compact disc (CD), laser disc,optical disc, digital versatile disc (DVD), floppy disk and blu-ray discwhere disks usually reproduce data magnetically, while discs reproducedata optically with lasers. Combinations of the above should also beincluded within the scope of computer-readable media.

FIG. 6 is an example flow diagram 600 illustrating operation of one ofthe embodiments of the present disclosure. At block 601, a user mayinitiate the special effect system 100. Such initiation may take theform of logging onto an associated application on the electronic device101. After initiation of the system 100, at block 603, the user mayidentify a text source she wishes to read aloud. Identification of thetext source may be performed by the user entering a title of a textsource, browsing for a text source title, or audibly speaking the nameof a text source title. Also using the electronic device 101, the usermay read a bar code, QR code, or other unique identifying mark on a textsource, the title of which may be extracted therefrom, for use with theapp. At block 605, the system 100 (e.g., the server 501 or anapplication running locally) may check if a special effect track existsfor the selected text source. If no soundtrack exists, at block 607, thesystem 100 (e.g., the server 501 or an application running locally) mayprompt the user to select another text source for reading. This processmay repeat until a text source is selected for which there exists aspecial effect track, at which time the process continues to block 609.At block 609, one or more special effect track files associated with theselected text source may be loaded onto the electronic device. One ofmore of the special effect track files may be downloaded (e.g., from thedatabase 509 via the server or the memory 508) to the electronic device101 (e.g., onto the memory 508). Alternatively, one or more specialeffect track files, or any portion thereof, may be retrieved from thedatabase via server 501 during reading of the selected text source.

At block 611, the speech recognition module 507 may be activated toreceive audible input from the reader, via the microphone of the inputunit 503. At block 613, the application continuously picks up on audiblemessages, checks the audible input for matches to one or more triggerphrases. Such a check may include comparing the spoken word(s) to wordsearchable files having an associated audio effect or soundtrack. Infurther embodiments, such a check may include comparing the spokenword(s) to a database of pre-programmed trigger phrases and delivering anumerical confidence score of a keyword being detected. Upon detection(or a score high enough to count as a detection), at block 615, thesystem 100 plays the special effect associated with the detected one ormore trigger phrases. The system 100 continues to listen for audiblemessages continuously during playing of the special effect(s). Thesystem 100 continues to listen for audible messages until the end of thetext source is reached, another text source is loaded, or the system 100is deactivated.

It should be appreciated that other embodiments are contemplated. Forexample, one or more components of the system 100 may be employed forintegration with movie or other media content. For example, the system100 may be programmed to output special effects in response to audiblemessages from currently playing media content. These effects may beaudible or other forms depending on other connected devices. Forexample, and as reflected in FIG. 7, the system 100 may be programmed toqueue one or more special effects generated by one or more internet ofthings (IoT) devices 701 which may be embedded with electronics,software, and/or sensors with network connectivity, which may enable theobjects to collect and exchange data. For example, these IoT devices 701may potentially be associated with home automation which may include butare not limited to lights, fans, garage doors, alarm systems, heatingand cooling systems, doorbells, microwaves, and refrigerators, that mayallow the system 100 to access, control, and/or configure the same togenerate a special effect or prompt another IoT device to generate aspecial effect.

Similar to the above discussed creation of special effect tracks fortext sources, operators may create special effect tracks customized forany movie or other media content. Such interactively with home devicesmay operate both ways. For example, in response to one or more specialeffects output by the aforediscussed home devices, media content playedfrom (e.g., a video player), may be controlled (e.g., paused, sped up,slowed down, skipped through, changed) directly through such homedevices. Accordingly, one or more embodiments of the present disclosureallow for the communication of objects, animals, people, and the like(e.g., through a network) without human-to-human or human-to-computerinteraction.

Embodiments of the present disclosure may also allow for thecommunication of any of the afore-discussed devices through use ofpre-defined audio sources. For example, operators of the system 100 mayprogram audio sources to configure a connected device to operate in adesired manner. The pre-defined audio may be any content, such astelevision, radio, audiobook, music, and the like. Any connected devicewithin a distance capable of detecting audio from pre-defined audiosource can communicate and interact in a predetermined manner.

Certain embodiments of the present disclosure may also relate toadvertising. For example, a person may be watching programming (e.g., ona television) and a car advertisement is played. The person may have anelectronic device in close proximity. As such, the electronic device(e.g., speech recognition module) may detect the car advertisementthrough, for example, pre-programmed trigger words that are playedduring the advertisement. Consequently, the system may be configured todetect that the user is listening to the advertisement in real time.Further, the system may be configured to present (on the electronicdevice) corollary content, such as another advertisement or other mediarelated to the detected car advertisement that was displayed on thetelevision.

Further, certain embodiments of the present disclosure may relate todata analytics allowing a user of an electronic device to ascertaininformation about other users and associated electronic device throughuse of the aforediscussed system. For example, the system may beconfigured to determine if and when a user has been within an audibledistance of an advertisement as discussed above. Such a system can havethe words played in the advertisement pre-programmed as trigger words,and thereby determine if and when the trigger words are detected thussignaling that a user has heard the advertisement.

It will be appreciated by those skilled in the art that changes could bemade to the embodiments described above without departing from the broadinventive concept thereof. It is understood, therefore, that thisinvention is not limited to the particular embodiments disclosed, but itis intended to cover modifications within the spirit and scope of thepresent invention as defined by the appended claims.

One example aspect is directed to a system for providing a specialeffect with an auditory input. The system comprises of an electronicdevice. The electronic device is configured to receive an audible inputfrom a user, determine whether the audible input matches apre-determined trigger, and if the audible input matches thepre-determined trigger, command a special effect device to output one ormore special effects associated with the auditory input.

In some implementations, the special effect device comprises an audiospeaker, and at least one of the one or more special effects includesaudio content.

In some implementations, the special effect device comprises a homeappliance. The home appliance includes a light source, and at least oneof the one or more special effects includes light emission.

In some implementations, the special effect device comprises an audiospeaker and a light source, and the at least one of the one or morespecial effects includes audio content and light emission.

In some implementations, the special effect device comprises a homeappliance. The home appliance includes a light source, an audio speaker,a fan, a garage door, an alarm system, a heating or cooling system, adoorbell, a microwave, a refrigerator, or combinations thereof.

In some implementations, the pre-determined trigger comprises one ormore trigger words corresponding to one or more trigger words of a textsource.

In some implementations, the one or more trigger words of the textsource are spoken prior to one or more feature words. The electronicdevice is configured to output one or more special effects generallysimultaneously to an audible output of at least one of the one or morefeature words.

One example aspect is directed to a system for providing a specialeffect associated with an auditory input. The system comprises anelectronic device. The electronic device is configured to receive anaudible input of one or more portions of a text source. The electronicdevice is also configured to access a plurality of pre-determinedtriggers. The pre-determined triggers comprise active pre-determinedtriggers and inactive pre-determined triggers. In response todetermining that the audible input matches an active pre-determinedtrigger, the electronic device is configured to command a special effectdevice to output one or more special effects associated with theauditory input; or command the system to activate at least one of theplurality of inactive pre-determined triggers, and, in response todetermining that the audible input matches the activated pre-determinedtrigger, command the special effect device to output one or more specialeffects associated with the auditory input.

In some implementations, the active pre-determined triggers correspondto one or more words of a first portion of the text source, and theinactive pre-determined triggers correspond to one or more words of asecond portion of the text source.

In some implementations, one or more of the words of the first andsecond portions are the same.

In some implementations, the first portion is different than the secondportion.

In some implementations, the first portion is associated with onechapter of the text source, and the second portion is associated withanother second chapter of the text source.

In some implementations, in response to determining that the audibleinput matches an active pre-determined trigger, the electronic device isconfigured to command a special effect device to output one or morespecial effects associated with the auditory input, and command thesystem to activate at least one of the plurality of inactivepre-determined triggers; and, in response to determining that theaudible input matches the activated pre-determined trigger, command thespecial effect device to output one or more special effects associatedwith the auditory input.

One example aspect is directed to a system for providing a specialeffect associated with an auditory input. The system comprises anelectronic device. The electronic device is configured to receive anaudible input of one or more portions of a text source. The electronicdevice is also configured to access a plurality of pre-determinedtriggers comprising active pre-determined triggers and inactivepre-determined triggers. The electronic device is also configured to, inresponse to determining that the audible input matches an activepre-determined trigger, command a first special effect device to outputone or more first special effects associated with the auditory input, orcommand the system to activate at least one of the plurality of inactivepre-determined triggers; and, in response to determining that theaudible input matches the activated pre-determined trigger, command thefirst special effect device or a second special effect device to outputone or more first special effects associated with the auditory input.The electronic device is also configured to command the first specialeffect device to play a second special effect in response to determiningthat the one or more of the plurality of active or activatedpre-determined triggers was detected.

In some implementations, the one or more active or activatedpre-determined triggers causing output of the first special effect isdifferent from the one or more pre-determined triggers causing output ofthe second special effect. In some implementations, the second specialeffect is output at least partly concurrently with the output of thefirst special effect.

In some implementations, the first or second special effect devicecomprises an audio speaker, and at least one of the one or more specialeffects includes audio content.

In some implementations, the first or second special effect devicecomprises a home appliance. The home appliance includes a light source.At least one of the one or more special effects includes light emission.

In some implementations, the first or second special effect devicecomprises an audio speaker and a light source. The at least one of theone or more special effects includes audio content and light emission.

In some implementations, the second special effect is output from thesecond special effect device.

In some implementations, the plurality of pre-determined triggerscomprises one or more trigger words corresponding to one or more triggerwords of the text source. In some implementations, one or more triggerwords of the text source are spoken prior to one or more feature words.In some implementations, the electronic device is configured to outputone or more special effects generally simultaneously to an audibleoutput of at least one of the one or more feature words.

One example aspect is directed to a system for providing special effectsin response to a user reading aloud a text source comprising a pluralityof feature phrases and a plurality of trigger phrases. The systemcomprises one or more processors. The one or more processors areconfigured to access a database comprising a plurality of pre-determinedtrigger phrases, determine when one or more of the plurality ofpre-determined trigger phrases is detected via a speech recognitionalgorithm, and command a special effect output device to play a specialeffect in response to determining that the pre-determined trigger wasdetected.

One example aspect is directed to a system for providing special effectsin response to a user reading aloud a text source. The system comprisesone or more processors. The one or more processors are configured toaccess a database comprising a plurality of pre-determined triggerphrases, determine when one of more of the plurality of pre-determinedtrigger phrases is detected via a speech recognition algorithm, andcommand a special effect output device to play a special effect inresponse to determining that the pre-determined trigger was detected,concurrently with playing the special effect, determine when a secondpre-determined trigger phrase is detected via the speech recognitionalgorithm.

In some implementations, a plurality of pre-determined trigger phrasesare correlated to a single special effect. In some implementations, inresponse to determining that one of the plurality of pre-determinedtrigger phrases are detected, the system plays the single specialeffect. In some implementations, the remainder of the plurality ofpre-determined trigger phrases are deactivated from outputting thesingle special effect.

In some implementations, the one or more processors are configured tooutput the special effect in response to determining that two or more ofthe plurality of pre-determined trigger phrases are detected.

In some implementations, the one or more processors are configured todeactivate at least one of the plurality of pre-determined triggerphrases after detection of the at least one of the plurality ofpre-determined trigger phrases such that a user subsequently reading theat least one of the plurality of pre-determined trigger phrases does notresult in commanding the special effect output device to output aspecial effect.

In some implementations, the one or more processors are configured tokeep at least one of the plurality of pre-determined trigger phrasesactive after detection such that a user subsequently reading the atleast one of the plurality of pre-determined trigger phrases afterdetection of the at least one of the plurality of pre-determined triggerphrases results in commanding the special effect device to output aspecial effect again.

In some implementations, the one or more processors are configured tokeep at least one of the plurality of pre-determined trigger phrasesactive after detection. In some implementations, upon a subsequentdetection of the at least one of the plurality of pre-determined triggerphrases, the processors are configured to command the special effectoutput device to output a different special effect than was outputtedupon the first detection of the at least one of the plurality ofpre-determined trigger phrases.

In some implementations, the one or more processors are configured totemporarily deactivate for a pre-defined time period at least one of theplurality of pre-determined trigger phrases after detection of the atleast one of the plurality of pre-determined trigger phrases such that auser subsequently reading the at least one of the plurality ofpre-determined trigger phrases after detection of the at least one ofthe plurality of pre-determined trigger phrases and during its temporarydeactivation does not result in commanding the special effect outputdevice to output a special effect. In some implementations,subsequently, after the pre-defined time period has elapsed, the one ormore processors are configured to reactivate the at least one of theplurality of pre-determined trigger phrases such that a subsequentdetection of the at least one of the plurality of pre-determined triggerphrases, command the special effect output device to output a differentspecial effect than outputting upon the first detection of the at leastone of the plurality of pre-determined trigger phrases.

In some implementations, the one or more processors are furtherconfigured to command the special effect output device to play a secondspecial effect concurrently with the first special effect.

In some implementations, the one or more processors are furtherconfigured to determine when a third pre-determined trigger phrase isdetected via the speech recognition algorithm concurrently with playingthe first and second special effect.

In some implementations, the one or more processors are furtherconfigured to command the special effect output device to play a thirdspecial effect in response to determining that the pre-determinedtrigger was detected.

One example aspect is directed to a system for providing special effectsin response to a user reading aloud a text source. The system comprisesone or more processors configured to access a database comprising aplurality of pre-determined trigger phrases. The system comprises theability to determine when one or more of the plurality of pre-determinedtrigger phrases is detected via a speech recognition algorithm. Thesystem comprises the ability to command a special effect output deviceto play a first special effect in response to determining that one ormore of the plurality of pre-determined triggers was detected. Thesystem comprises the ability to command the special effect output deviceto play a second special effect in response to determining that one ormore of the plurality of pre-determined trigger phrases was detected. Insome implementations, one or more of the plurality of pre-determinedtrigger phrases causing output of the first special effect is differentfrom the one or more of the plurality of pre-determined trigger phrasescausing output of the second special effect. In some implementations,the second special effect is outputted at least partly concurrently withoutputting the first special effect. In some implementations, one ormore processors are configured to determine when a pre-determinedtrigger phrase is detected via a speech recognition algorithmconcurrently while playing the special effect.

One example aspect is directed to a system for providing special effectsin response to a user reading aloud a text source. The system comprisesa processor. The processor is configured to determine one or more of aplurality of databases to access. Each database comprises a plurality ofpre-determined trigger phrases. The processor is configured to accessone or more of a plurality of databases. The processor is configured todetermine when one or more of the pre-determined trigger phrases isdetected via a speech recognition algorithm. In some implementations, inresponse to determining that one or more of the pre-determined triggerphrases was detected, the processor is configured to command a specialeffect output device to output a special effect. In someimplementations, in response to determining that one or more of thepre-determined trigger phrases was detected, the processor is configuredto access a different database of pre-determined trigger phrases.

In some implementations, one or more of the pre-determined triggerphrases causing the outputting of a special effect is the same as one ormore of the pre-determined trigger phrases causing the system to accessa different datable of pre-determined trigger phrases.

In some implementations, one or more of the pre-determined triggerphrases causing the outputting of a special effect is different than oneor more of the preOdetermined trigger phrases causing the system toaccess a different database of pre-determined trigger phrases.

One example aspect is directed to a system for providing special effectsin response to a user reading a text source aloud. The system comprisesone or more processors. The one or more processors are configured toaccess a database comprising a plurality of pre-determined triggerphrases. The plurality of pre-determined trigger phrases are alsophrases in the text source. The one or more processors are configured todetermine when one or more of the plurality of pre-determined triggerphrases is detected via a speech recognition algorithm when the userreads the text source aloud, and command a special effect output deviceto play one of a plurality of special effects in response to determiningthat the pre-determined trigger was detected. The one or more processorsare configured to, upon a subsequent accessing of the database, commandthe special effect output device to play a different special effectamong the plurality of special effects in response to determining thatthe same pre-determined trigger phrase was detected.

One example aspect is directed to a system for providing special effectsin response to a user reading a text source. The system comprises one ormore processors. The one or more processors are configured to access afirst database comprising a first plurality of pre-determined triggerphrases. The one or more processors are configured to determine when oneor more of the first plurality of pre-determined trigger phrases isdetected via a speech recognition algorithm. The one or more processorsare configured to command a special effect output device to play one ofa plurality of special effects in response to determining that the oneor more of the first plurality of pre-determined triggers was detected.The one or more processors are configured to, upon a subsequentinitiation of the system, access a second database comprising a secondplurality of pre-determined trigger phrases. In some implementations, atleast one pre-determined trigger phrase in the first database isdifferent from the second plurality of pre-determined trigger phrases.

One example aspect is directed to a system for providing dynamic specialeffects in response to a user reading a text source aloud. The systemcomprises one or more processors. The one or more processors areconfigured to access a database comprising a plurality of pre-determinedtrigger phrases. In some implementations, the plurality ofpre-determined trigger phrases are also phrases in the text source. Theone or more processors are configured to determine when one or more ofthe plurality of pre-determined trigger phrases is detected via a speechrecognition algorithm when the user reads the text source aloud. The oneor more processors are configured to command a special effect device toplay one of a plurality of special effects in response to determiningthat the pre-determined trigger was detected. The one or more processorsare configured to command the special effect output device to play oneof a plurality of special effects in response to determining that apre-determined quantity or sequence of pre-determined trigger phrasesare detected.

In some implementations, the text source comprises a physical textsource.

In some implementations, the text source comprises an electronicallydisplayed text source.

One example aspect is directed to a system for providing special effectsin response to a text source being read aloud. The system comprises ofone or more processors. The one or more processors are configured toaccess a database comprising a plurality of pre-determined triggerphrases. The one or more processors are configured to determine when oneor more of the plurality of pre-determined trigger phrases is detectedfrom an electronically delivered auditory speech. The one or moreprocessors are configured to command a special effect output device toplay a special effect in response to determining that the pre-determinedtrigger was detected.

In some implementations, the electronically delivered auditory speech ispre-recorded.

In some implementations, the electronically delivered auditory speech isthe form of an audiobook.

In some implementations, the electronically delivered auditory speech isthe form of a movie.

In some implementations, the electronically delivered auditory speech islive.

In some implementations, the electronically delivered auditory speech isdelivered from a live user.

In some implementations, the electronically delivered auditory speech isdelivered from a recording of a live user.

In some implementations, the one or more processors are furtherconfigured to determine when one or more of the plurality ofpre-determined trigger phase is detected from live auditory speechdelivered by a user. In some implementations, the one or more processorsare further configured to command a special effect output device to playa special effect in response to determining that the pre-determinedtrigger was detected.

In some implementations, the text source is in the form of a physicaltext source.

In some implementations, the physical text source is in the form of aphysical book.

In some implementations, the text source is in the form of anelectronically displayed text source.

In some implementations, at least one special effect comprises anauditory special effect.

In some implementations, at least one special effect comprises a visualspecial effect.

In some implementations, at least one special effect comprises anenvironmental special effect.

In some implementations, the special effect is correlated to the textsource.

In some implementations, the text source further comprisesillustrations, and at least one special effect is correlated to theillustrations.

In some implementations, at least two special effects are outputtedconcurrently.

In some implementations, at least one special effect output device isphysically distinct from one or more processors.

In some implementations, at least one special effect is outputtedgenerally in real-time with the auditory speech comprising thepre-determined trigger phase is delivered.

One example aspect is directed to a system for providing special effectsin response to a physical text source being read aloud. The systemcomprises of one or more processors. The one of more processors areconfigured to access a database comprising a plurality of pre-determinedtrigger phrases. The one or more processors are configured to determinewhen one or more of the plurality of pre-determined trigger phrases isdetected from an electronically delivered auditory speech in the form ofan audiobook. The one or more processors are configured to command aspecial effect output device to play a special effect in response todetermining that the pre-determined trigger was detected.

In some implementations, the special effect comprises an auditoryspecial effect.

In some implementations, the special effect comprises a visual specialeffect.

In some implementations, the special effect comprises an environmentalspecial effect.

One example aspect is directed to a system for providing special effectsin response to a media. The media comprises visual and auditorycomponents being played. The system comprises one or more processors.The one or more processors are configured to access a databasecomprising a plurality of pre-determined trigger phrases. The one ormore processors are configured to determine when one or more of theplurality of pre-determined trigger phrases is detected from anelectronically delivered auditory speech from a media comprising visualand auditory components. The one or more processors are configured tocommand a special effect output device to play a special effect inresponse to determining that the pre-determined trigger was detected. Insome implementations, the special effect is correlated to the media.

In some implementations, at least one special effect comprises anenvironmental special effect.

In some implementations, one or more processors are configured tocommand a special effect output device to play a special effectgenerally in real-time with the trigger phrase being delivered from themovie.

One example aspect is directed to a system for providing special effectsin response to a user reading a text source. The system comprises one ormore processors. The one or more processors are configured to access adatabase comprising a plurality of pre-determined trigger phrases. Theone or more processors are configured to determine when one or more ofthe plurality of pre-determined trigger phrases is detected from anelectronically delivered auditory speech from a media comprising visualand auditory components. The one or more processors are configured tocommand a special effect output device to play a special effect inresponse to determining that the pre-determined trigger was detected. Insome implementations, the special effect is correlated to the media.

One example aspect is directed to a system for providing special effectsin response to a text source being read aloud. The system comprises oneor more processors. The one or more processors are configured to accessa database comprising a plurality of pre-determined trigger phrases. Theone or more processors are configured to determine when one or more ofthe plurality of pre-determined trigger phrases is detected from atleast one of a plurality of auditory sensors. In some implementations,the plurality of auditory sensors are in different locations. The one ormore processors are configured to command a plurality of special effectoutput devices to each generally simultaneously play a special effect inresponse to determining that the pre-determined trigger was detected.

In some implementations, one or more processors are configured tocommand a plurality of special effect output devices to each generallysimultaneously play the same special effect in response to determiningthat the pre-determined trigger was detected.

In some implementations, one or more processors are configured tocommand a plurality of special effect output devices to each generallysimultaneously play different special effects in response to determiningthat the pre-determined trigger was detected.

One example aspect is directed to a system for providing special effectsin response to a text source being read aloud. The system comprises oneor more processors. The one or more processors are configured to receivean auditory input from a first user. The one or more processors areconfigured to receive an auditory input from a second user. The one ormore processors are configured to command a first special effect outputdevice. In some implementations, the first special effect output deviceis in communication with the first user. The one or more processors areconfigured to command a second special effect output device. In someimplementations, the second special effect output device is incommunication with the second user. The one or more processors areconfigured to access a database comprising a plurality of pre-determinedtrigger phrases. The one or more processors are configured to determinewhen one or more of the plurality of pre-determined trigger phrases isdetected from either the first user or the second user. In someimplementations, the first user is in a different location than thesecond user. The one or more processors are configured to command thefirst special effect output device to play a special effect in responseto determining that the pre-determined trigger was detected from thefirst user.

In some implementations, one or more processors are further configuredto command the second special effect output device to play a specialeffect in response to determining that the pre-determined trigger phrasewas detected from the second user.

In some implementations, one or more processors are configured tocommand the second special effect output device to play the same specialeffect as outputted by the first special effect output device inresponse to determining that the pre-determined trigger phrase wasdetected from the first user.

One example aspect is directed to a system for providing special effectsin response to a user reading a text source. The system comprises one ormore processors. The one or more processors are configured to access adatabase comprising a plurality of pre-determined trigger phrases. Theone or more processors are configured to determine when a pre-determinedtrigger phrase is detected. The one or more processors are configured tocommand a special effect output device to play a special effect inresponse to determining that the pre-determined trigger was detected. Insome implementations, commanding the special effect comprises accessingsensory information about a remote location and outputting the specialeffect based, at least in-part, on the sensory information.

In some implementations, the sensory information comprises real-timesensory information.

In some implementations, the sensory information is updated at leastonce in a 24 hour period.

In some implementations, the sensory information is updated at leastonce in a 12 hour period.

In some implementations, the sensory information is updated at leastonce in a 1 hour period.

In some implementations, the sensory information comprises pre-recordedsensory information.

In some implementations, commanding the special effect comprisesaccessing sensory information comprising auditory information, visualinformation, environmental information, or combinations thereof about aremote location and outputting the special effect based, at leastin-part, on the sensory information.

In some implementations, commanding the special effect comprisesaccessing sensory information comprising auditory information about aremote location and outputting the special effect based, at leastin-part, on the auditory sensory information.

In some implementations, auditory information comprises sound detectedfrom the remote location.

In some implementations, commanding the special effect comprisesaccessing sensory information comprising visual information about aremote location and outputting the special effect based, at leastin-part, on the visual sensory information.

In some implementations, visual information comprises images, video, ora combination thereof recorded from the remote location.

In some implementations, commanding the special effect comprisesaccessing sensory information comprising environmental information abouta remote location and outputting the special effect based, at leastin-part, on the environmental sensory information.

In some implementations, the environmental information comprises weatherinformation.

In some implementations, the environmental information comprises weatherinformation comprising temperature information, pressure information,wind information, sun positioning information, or combinations thereof.

In some implementations, the environmental information comprises scentinformation.

In some implementations, the remote location is static.

In some implementations, the remote location is dynamic.

In some implementations, the remote location is separate and distinctfrom a special effect output location.

In some implementations, one or more processors are configured todetermine when a trigger phrase is detected via a speech recognitionengine.

In some implementations, one or more processors are configured todetermine when a trigger phrase is detected via an algorithm adapted todetermine a user's reading position.

In some implementations, one or more processors are configured todetermine when a trigger phrase is detected concurrently with outputtinga special effect.

One example aspect is directed to a system for providing special effectsin response to a user reading aloud a text source. The system comprisesone or more processors. The one of more processors are configured toaccess a first database comprising a first plurality of pre-determinedtrigger phrases. In some implementations, at least two of the firstplurality of pre-determined trigger phrases are in a different language.The one or more processors are configured to determine when apre-determined trigger phrase is detected via a speech recognitionalgorithm and determine the language of the detected pre-determinedtrigger phrase. The one or more processors are configured to access asecond database comprising a second plurality of pre-determined triggerphrases in response to determining that at least one of the firstplurality of pre-determined trigger phrases was detected. In someimplementations, the second plurality of pre-determined trigger phrasesis in the same language as the detected pre-determined trigger phrase inthe first database. The one or more processors are configured to commanda special effect output device to play a special effect in response todetermining that at least one of the second plurality of pre-determinedtrigger phrases was detected.

In some implementations, the one or more processors are furtherconfigured to access a third database comprising a third plurality ofpre-determined trigger phrases in response to determining that at leastone of the first plurality of pre-determining trigger phrases having thesame language as the language of the third plurality of pre-determinedtrigger phrases was detected.

In some implementations, the one or more processors are furtherconfigured to command the special effect output device to play thespecial effect in response to determining that at least one of the thirdplurality of pre-determined trigger phrases was detected.

In some implementations, the special effect is correlated to the textsource.

One example aspect is directed to a device. The device comprises amicrophone. The device comprises a speaker. The device comprises acommunication assembly adapted to communicate with an electronic device.In some implementations, the device is in the form of a bookmark.

In some implementations, the communication assembly is adapted towirelessly communication with the electronic device.

In some implementations, the communication assembly is adapted towirelessly communicate through Bluetooth, wifi, or combinations thereofwith the electronic device.

In some implementations, the electronic device comprises a mobiledevice, a computer, a smart home device, or any combinations thereof.

In some implementations, the device comprises an attachment portionadapted to attach to a book.

In some implementations, the device comprises an attachment portioncomprising a clamp assembly adapted to removably couple the device to abook.

In some implementations, the device further comprises a display capableof displaying video content.

In some implementation, the display comprises a touch-screen.

One example aspect is directed to a system for providing synchronizedspecial effects with a text source. The system comprises a text sourcecomprising a plurality of text characters. The system comprises aprocessor. The processor is configured to receive an auditory inputcomprising an auditory sound of one or more of the plurality of textcharacters. The processor is configured to initiate one or more specialeffects correlated to the text source in response to detecting anauditory sound of one or more of the plurality of text characters. Thesystem comprises a device, separate and distinct from the processor or adevice containing the processor. In some implementations, the devicecomprises an auditory receptor and a communication assembly. In someimplementations, the device is in the form of a bookmark. In someimplementations, the communication assembly is adapted for communicationwith the processor to relay the auditory input.

One example aspect is directed to a system for providing auditoryfeedback in response to one or more users reading aloud a text source.The system comprises one or more processors. The one or more processorsare configured to access a database comprising a plurality ofpre-determined trigger phrases. The one or more processors areconfigured to determine when one or more of the plurality ofpre-determined trigger phrases is detected via a speech recognitionalgorithm. The one or more processors are configured to command anauditory output device to play an auditory signal in response todetermining that the pre-determined trigger was detected. In someimplementations, the text source comprises a plurality of parts orcharacters. In some implementations, the one or more processors arefurther configured to selectively enable one or more of the charactersto be enabled.

One example aspect is directed to a system for providing data analyticsof a media. The system comprises one or more processors. The one or moreprocessors are configured to receive an auditory input comprising apre-determined trigger phrase. The one or more processors are configuredto determine when the pre-determined trigger is detected. The one ormore processors are configured to, in response to determining that thepre-determined trigger is detected, record and store analyticalinformation related to the environment at the location of detection. Insome implementations, the analytical information comprises time oftrigger detection, date of trigger detection, location of triggerdetection, volume of the auditory component of the media, distance fromauditory component of the media, temperature information, humidityinformation, engagement with electronic device information, orcombinations thereof.

One example aspect is directed to a system for providing dynamicinteraction with media content. The system comprises one or moreprocessors. The one or more processors are configured to receive anauditory input from a media comprising a pre-determined trigger phrase.The one or more processors are configured to determine when thepre-determined trigger phrase is detected. The one or more processorsare configured to, in response to determining that the pre-determinedtrigger is detected. The one or more processors are configured totransmit a signal to one or more mobile devices. The one or moreprocessors are configured to display content on the mobile devicecorrelated to the pre-determined trigger.

One example aspect is directed to a system for providing special effectsin response to a user reading aloud a text source. The system comprisesone or more processors. The one or more processors are configured toaccess a database comprising a plurality of pre-determined triggerphrases. The one or more processors are configured to determine when oneor more of the plurality of pre-determined trigger phrases is detectedvia a speech recognition algorithm. The one or more processors areconfigured to command a special effect output device to play a specialeffect in response to determining that the pre-determined trigger wasdetected. The one or more processors are configured to determine when asecond pre-determined trigger phrase is detected via a speechrecognition algorithm. The one or more processors are configured tocommand the special effect output device to manipulate the specialeffect in response to determine that the second pre-determined triggerphrase was detected.

What is claimed is:
 1. A system for providing a special effect associated with an auditory input, the system comprising: an electronic device configured to: receive an audible input from a user comprising speech of the user reading one or more portions of a text source, wherein the electronic device is physically separate and distinct from the text source; access a plurality of pre-determined triggers associated with the text source, wherein the plurality of pre-determined triggers comprise two or more active pre-determined triggers included in a current window of active triggers and one or more inactive pre-determined triggers that are not included in the current window, wherein the two or more active pre-determined triggers included in the current window of active triggers are determined based on a position within the text source of the one or more portions of the text source read by the user, wherein the electronic device is configured to cause one or more special effects upon matching the audible input to any of the two or more active pre-determined triggers but configured to not cause special effects upon matching the audible input to one of the one or more inactive pre-determined triggers; determine whether the audible input matches at least one active trigger of the two or more active pre-determined triggers via a speech recognition algorithm; in response to determining that the audible input matches a first active trigger of the two or more active pre-determined triggers, command a special effect device to output a first special effect associated with the text source, wherein the special effect device comprises an audio speaker, and wherein the first special effect comprises first audio content; in response to determining that the audible input matches the first active trigger of the one or more active pre-determined triggers, modify the current window of active triggers to deactivate the first active trigger to an inactive pre-determined trigger that is not included in the current window; and during playback of the first audio content by the special effect device: continuously listen for and receive additional audible input from the user; determine whether the additional audible input matches at least one active trigger via the speech recognition algorithm; in response to determining that the additional audible input matches a second active trigger, command the special effect device to output a second special effect associated with the text source at least partially concurrently with output of the first special effect, wherein the second special effect is different than the first special effect; and in response to determining that the additional audible input matches the second active trigger, modify the current window of active triggers to deactivate the second active trigger to be an inactive pre-determined trigger that is not included in the current window.
 2. The system of claim 1, wherein the text source is pre-existing.
 3. The system of claim 1, wherein the text source comprises a book.
 4. The system of claim 1, wherein the text source comprises a comic book.
 5. The system of claim 1, wherein the text source comprises a printed text source.
 6. The system of claim 1, wherein the text source comprises an electronically displayed text source.
 7. The system of claim 1, wherein the electronic device is configured to command the special effect device to begin outputting the first special effect before beginning to output the second special effect.
 8. The system of claim 1, wherein the electronic device is configured to command the special effect device to stop outputting the first special effect before stopping the output of the second special effect.
 9. The system of claim 1, wherein the electronic device is configured to command the special effect device to stop outputting the second special effect before stopping the output of the first special effect.
 10. The system of claim 1, wherein the second special effect comprises a second audio output, and the electronic device is configured to cause output of a third special effect comprising a light emission.
 11. The system of claim 1, wherein the electronic device is communicably coupled but physically distinct from the special effect device.
 12. The system of claim 1, wherein at least one of the first or the second special effects comprises animation.
 13. The system of claim 1, wherein at least one of the first or the second special effects comprises video.
 14. The system of claim 1, wherein at least one of the first or the second special effects comprises a picture.
 15. The system of claim 1, wherein in response to determining that the audible input matches the first active trigger of the one or more active pre-determined triggers the electronic device is configured to activate one of the one or more inactive pre-determined triggers.
 16. The system of claim 1, wherein the audible input from the user comprising speech of the user reading one or more portions of the text source is pre-recorded and electronically outputted.
 17. A system for providing a special effect associated with an auditory input, the system comprising: an electronic device configured to: receive an audible input from a user comprising speech of the user reading one or more portions of a text source; access a plurality of pre-determined triggers associated with the text source, wherein the plurality of pre-determined triggers comprise two or more active pre-determined triggers included in a current window of active triggers and one or more inactive pre-determined triggers that are not included in the current window, wherein the two or more active pre-determined triggers included in the current window of active triggers are determined based on a position within the text source of the one or more portions of the text source read by the user, wherein the electronic device is configured to cause one or more special effects upon matching the audible input to any of the two or more active pre-determined triggers but configured to not cause special effects upon matching the audible input to one of the one or more inactive pre-determined triggers; determine whether the audible input matches at least one active trigger of the two or more active pre-determined triggers via a speech recognition algorithm, wherein the one or more pre-determined triggers comprise active pre-determined triggers and inactive pre-determined triggers; in response to determining that the audible input matches the at least one active trigger of the two or more active pre-determined triggers, command one or more special effect devices to output a plurality of special effects associated with the text source; and in response to determining that the audible input matches the at least one active trigger of the two or more active pre-determined triggers modify the current window of active triggers to deactivate the at least one active trigger to an inactive pre-determined trigger that is not included in the current window; wherein the one or more special effect devices comprises an audio speaker, and the at least one of the one or more special effects includes audio content; wherein the plurality of special effects comprises a first special effect comprising an audio output and a second special effect comprising an audio output different from the first special effect; wherein the electronic device is configured to command the one or more special effect devices to output the second special effect at least partially concurrently with outputting the first special effect; wherein the at least one active trigger comprises a first active trigger that causes output of the first special effect and a second active trigger that causes output of the second special effect, wherein the first active trigger is at least partly different than the second active trigger; and wherein the electronic device is configured to determine when an additional active trigger phrase is detected via the speech recognition algorithm at least partly concurrently while the one or more special effect devices are outputting the plurality of special effects.
 18. A system for providing a special effect associated with an auditory input, the system comprising: an electronic device configured to: receive an audible input from a user comprising speech of the user reading one or more portions of a text source, wherein the text source comprises a printed book; access a plurality of pre-determined triggers associated with the text source, wherein the plurality of pre-determined triggers comprise two or more active pre-determined triggers included in a current window of active triggers and one or more inactive pre-determined triggers that are not included in the current window, wherein the two or more active pre-determined triggers included in the current window of active triggers are determined based on a position within the text source of the one or more portions of the text source read by the user, wherein the electronic device is configured to cause one or more special effects upon matching the audible input to any of the two or more active pre-determined triggers but configured to not cause special effects upon matching the audible input to one of the one or more inactive pre-determined triggers; determine whether the audible input matches at least one active trigger of the two or more active pre-determined triggers via a speech recognition algorithm; in response to determining that the audible input matches the at least one active trigger of the two or more active pre-determined triggers, command one or more special effect devices to output a plurality of special effects associated with the text source; and in response to determining that the audible input matches the at least one active trigger of the two or more active pre-determined triggers, modify the current window of active triggers to deactivate the at least one active trigger to an inactive pre-determined trigger that is not included in the current window; wherein the one or more special effect devices comprises an audio speaker and at least one of the plurality of special effects includes audio content; wherein the plurality of special effects comprises a first special effect comprising an audio output, and a second special effect comprising an audio output different from the first special effect; wherein the at least one active trigger comprises a first active trigger that causes output of the first special effect and a second active trigger that causes output of the second special effect, wherein the first active trigger is at least partly different than the second active trigger; and wherein the electronic device is configured to determine when an additional pre-determined trigger phrase is detected via the speech recognition algorithm at least partly concurrently while the one or more special effect devices are outputting the plurality of special effects. 